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Abstract 

An infinite word x is said to be quasiperiodic if there exists a finite word q such that 
x is covered by occurrences of q (such a q is called a quasiperiod of x). Using the 
Q . notion of derivation, we show that this definition is not sufficient to imply any sym- 

metry in an infinite word. Therefore we introduce multi-scale quasiperiodic words, 
i.e. quasiperiodic words that admit an infinite number of quasiperiods. Such words 
are uniformly recurrent, this allows us to study the subshift they generate. We 
prove that multi-scale quasiperiodic subshifts are uniquely ergodic and have zero 
topological entropy as well as zero Kolmogorov complexity. Sturmian subshifts are 
shown to be multi-scale quasiperiodic. 
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1 Introduction and notations 



1.1 Background: quasiperiodicity and symmetry 

The general concept of symmetry is central in the study of infinite words, and many 
notions are devoted to measure it, like word complexity, recurrence, Kolmogorov's com- 
plexity, entropy, .... Those notions are more or less linked and in any case the most 
symmetric words are the periodic ones. 

In |Marlj . the second author introduced a new notion to describe symmetric words: 
quasiperiodicity. This notion is an extension of the notion of quasiperiodicity for finite 
words that was introduced in |ApoEhr| and has also roots in different contexts like mu- 
sicology [CrllRaj and molecular biology |KMGL| |Mil.Tur| . 

Let A be a finite set (called alphabet). Let x = xqX\X2 ... in A N be an infinite word. We 
will say that x is quasiperiodic if there exists a finite word q in A* such that x is covered 
by the occurences of q (A* denotes the set of finite words on the alphabet A). Such a q 
is called a quasiperiod of x ; it is a prefix of x. 

More precisely, let (z n ) ne N be the increasing sequence whose image is the set {i G 
N | = q}, where l(q) denotes the length of q, and x^j denotes 

(for < i < j). The word q is a quasiperiod of x whenever (i n ) is a well defined infinite 
sequence such that i = and i n+ i — i n < l(q) for any integer n. 

For example, the infinite word x = ababaabaabaababababaabababaabaabaababaaba. . . is 
quasiperiodic with aba as a quasiperiod and the infinite quasiperiodic words that admit 
aba as a quasiperiod are precisely the words over the alphabet {a, b} beginning with the 
word ab and in which the words aaa and bb do not appear. 

The second author was then looking for relations with other notions of symmetry. In 
|Marl| and jLevRiclj . it is shown that there is no such relation. Let us introduce a basic 
tool that will unify those results and allow us to understand this lack of fitting between 
quasiperiodicity and other notions: the derivation. 

1.2 The derivation: a change of scale 
1.2.1 Derivation 

Let x = xqX\X2 ... be a quasiperiodic word, and q be a quasiperiod of x. Let {i n )nm be 
the sequence as defined before. We can define || to be the infinite word on the alphabet 
{0, . . . , l(q) — 1} whose k th letter is l(q) — i^+i + ik {k > 0). 

In other terms, the k th letter of || is the length of the overlap between the k th and the 
(k + l) th occurrence of q in x. 

For example, if x = ababaabaabaababababaabababaabaabaababaaba. . . is quasiperiodic 
with aba as a quasiperiod, then /f- = 100011101100010. . . . 
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1.2.2 Integration 

In some sense, the operator removes exactly the information contained in q since the 
knowledge of || and q is sufficient to reconstruct x. 

Indeed, we can consider the reverse operation : the integration. If x is an infinite word on 
a finite alphabet AcN and if w is a finite word whose length is greater than max A, then 
we can define the word J x as the image of x under the substitution a w that replaces 
the occurences of i G A by the l(w) — i first letters of w. 

For example, if x = 01121010201... and w = aabcaa, 

then f x = aabcaaaabcaaabcaaabcaabcaaabcaaaabcaaabcaaaabcaabcaaaabcaaabcaa.... 

If w is such that for each n in A, w is a prefix of a w (n).w, then f x is quasiperiodic 
with w as a quasiperiod, no matter how x is random. This is the case for example if 
Ac {0, . . . , n} and w = a n ba n (n > 0) . 

1.2.3 Quasiperiodicity is spread everywhere 

Recurrence An infinite word x is said to be recurrent if any finite word u appearing in 
x appears infinitely often in x. It is said to be uniformly recurrent if moreover for 
each finite word u appearing in x, the gap between two consecutive occurences of 
u in x is bounded. 

To construct a non recurrent quasiperiodic word, just take a non recurrent word x 
on the alphabet {0, 1} and consider J aba x. Do the same to construct a uniformly 
recurrent quasiperiodic word. 

Minimality is the analogue of uniform recurrence in the vocabulary of topological 
dynamical systems (see section |2|). 

Complexity For any infinite word x and any integer n, L n (x) denotes the finite words 
of length n that occur in x and we define L(x) = [J nl - N L n (x). 

The function that sends an integer n to p n (x) = card(L„(x)) is called the word 
complexity of x. 

We can define an equivalence relation on the set of infinite words through the 
asymptotic behaviour of their complexity function: two words x and y are said 
to be complexity equivalent if there exists a positive integer K such that for all 
n>l, p n ( x ) < KpKn{y) andp n (y) < Kp Kn [x). Hence, bounded, linear, quadratic, 
polynomial or exponential growths are preserved under complexity equivalence. 

Theorem 1. There are quasiperiodic words in any class of complexity equivalence. 

Proof: Let x be an infinite word on an alphabet A. There is no restriction to 
suppose that A = {0, . . . , k}. Let y = j akba k x. We check that y is a quasiperiodic 
word in the class of complexity equivalence of x. □ 

The non ultimately periodic words with the smallest word complexity are the stur- 
mian words: an infinite word is said to be sturmian if for any integer n, p n (x) = n+1. 
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In |LevRicl) . Florence Leve and Gwenael Richomme proved that there exists stur- 
mian words that are not quasiperiodic. 

Entropy is the exponent of the complexity function in the vocabulary of topological 
dynamical systems (see section OJ). 

Frequencies If u and v are finite words, let #(u,v) denotes the number of occurences 
of u in v. An infinite word x is said to have frequencies if for any finite word w, 
xo^n-i) admits a limit when n tends to infinity. 

By integration, there exists quasiperiodic words that do not have frequencies: if x 
is an infinite word such that does not appear with frequencies, then J aba x is a 
quasiperiodic word such that bob does not appear with frequencies. 

Unique ergodicity is a strong analogue of having frequencies in the vocabulary of 
topological dynamical systems (see section 0]). 

Hence the notion of quasiperiocicity does not insert well among other notions of symmetry. 

1.3 Multi-scale quasiperiodic words 

We have to notice that all classical notions of symmetry are invariant under such a renor- 
malization procedure (derivation corresponds to induction if we are studying dynamical 
properties like entropy |Ahrj ). Therefore, if we want that a notion says something about 
symmetry, we should ensure that it is stable under such a change of scale. As we saw 
with the derivation procedure, the existence of a quasiperiod q in a word x just imposes 
rigidity at the scale around l(q) but does not impose anything at larger scales. 
This leads to the following definition : an infinite word is said to be multi-scale quasiperi- 
odic if the set Q(x) of its quasiperiods is infinite. 

The easiest non-periodic multi-scale quasiperiodic words we can construct, are the fixed 
points for some particular integration operators. For example, the fixed point of J QlQ is 
multi-scale quasiperiodic, it is known as the Fibonacci word (a precise description of the 
quasiperiods of this word can be found in |LevRicl| ). There exists much wilder multi- 
scale quasiperiodic words (see Theorem E] in sectional). 

In the next section we will prove that multi-scale quasiperiodic words are uniformly recur- 
rent. This will allow us to study the subshift generated by them (section [2j. Concerning 
the complexity, we will prove that multi-scale quasiperiodic subshifts have zero topolog- 
ical entropy as well as zero Kolmogorov complexity (section We will also prove that 
sturmian subshifts are multi-scale quasiperiodic (section El) . Concerning frequencies, we 
will prove that multi-scale quasiperiodic subshifts are uniquely ergodic (section 0J . 
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2 Uniform recurrence and minimality 

An infinite word x G A n is said to be uniformly recurrent if any finite word u G L(x) 
occurs infinitely many times in x and the gap between two consecutive occurrences of u 
in x is bounded, equivalently 

VueL{x) 3n>l Vf G L n (x) #(u,v) > 1 

Theorem 2. ^4n?/ multi-scale quasiperiodic word x is uniformly recurrent. 

Proof: Let u be a finite word that occurs in x. Since every quasiperiod of x is a prefix of 
x, one of them must contain an occurrence of u (they have unbounded length). Let q be 
such a quasiperiod. Any word in L 2 i( q ) contains at least an occurrence of q and therefore 
at least an occurrence of u. □ 

This property of multi-scale quasiperiodic words allows us to deal with the dynamical 
system generated by a multi-scale quasiperiodic word as follows: 

We endow A with the discrete topology and A n with the product topology. This makes 
A n a metrisable compact space. 
We note 



and we still note S for the restriction of S to X, making (X, S) a topological dynamical 
system. 

Theorem El is equivalent to say that (X, S) is a minimal subshift i.e. X is a nonempty 

closed subset of A n stable under S and that is minimal for those properties. 

A minimal subshift generated by a multi-scale quasiperiodic word is called a multi-scale 

quasiperiodic subshift. 

If u is a finite word, we define the cylinder 



We can notice that the derivation of a multisacled quasiperiodic word over the quasiperiod 
q corresponds to the induction of the subshift (X, S) on the cylinder [q]. 
We extend the notion of language and complexity to minimal subshifts: 




for the shift. It is a continuous map. 

If x is a multi-scale quasiperiodic word, we define 



X = {S k (x) | k G N} 



[ u ] = { x eX | (Vi < n - l)(xi = Ui)} (u = u ... u n ^ G A n (n G N)) 



L n (X) = {u G A n | [u] ^ 0} and p n (X) = card(L n (X)) (n G N) 



L(X) = |J L n (X) 



If y is in X, we have L(y) = L(X) and p.{y) = P.PO- 
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3 Complexity and topological entropy 



3.1 Word complexity 

Theorem 3. Let x be a multi-scale quasiperiodic word. Then 

y ■ f Pn{x) 

lim ml — — < 1 < oo 

n^oo n 

Proof: Let q G Q(x) and u G L t ^(x). Since q is a quasiperiod of x, u is a subword of 
some v.q where v is nonempty prefix of q. u is determined by the choice of v and his 
position in v.q. There are /(g) prefixes of q and for such a prefix v, there are l(v) < l(q) 
available positions for u (we do not count q several times). Finally, there are only /(g) 2 
possibilities for u and pi( q ){x) < /(g) 2 . The result follows since {/(g) | g G Q(x)} is not 
bounded. □ 



Theorem 4. For each positive function f : N — > that converges to zero, there exists 
a multi-scale quasiperiodic subshift (X,S) such that - \og(p n (X)) > f(n) for infinitely 
many n. In particular, we can ask to p n (X) to grow faster than any polynomial on a 
subsequence. 

Proof: Let tp : N — > N* be an increasing sequence such that 2nf(2nip(n)) < 1 for any n. 
Let A = {0, 1} be the alphabet. For n > 1, let w n be a finite word on A such that 

• w n begins and ends with the letter 0, 

• every word w of length 2n such that #(0, w) = #(1, w) = n appears as a subword 
of w n . 

Now, let us define by induction a sequence of finite words over A: 

• u = 010 

• u n+l = j Un «v ( /(„ n) ) 

which is well defined since for any n, u n begins and ends with the letter 0. 

Since u n = f u 0, u n is a prefix of u n+ i m . let x be the unique infinite word over A such 

that u n is a prefix of x for any n. 

If k < /, ui is covered by occurences of Uk, so all the u n are quasiperiods of x and x is 

multi-scale quasiperiodic. Let (X, S) denote the associated minimal subshift 

Let n be a positive integer. u n+ \ and therefore x contain an occurence of f u w, where w is 

any word of length 2ip(l(u n )) such that #(0,w) = #(l,iu) = </?(/(u n )). There are at least 

2 ( ^( / («n)) diff eren t words and each of them has length ip(l(u n ))(l(u n ) + (l(u n ) — 1)). 

Hence, 

V2<p(l(u n ))l{u n )(X) > P v (l(u n ))(l(u n )+(l(u n )-l))(X) > 2^ l{Un)) 

So, 

2,,(/K )) /K ) l ^M^ M u n) (x)) > 2^§§o = 21^) " ^ Z W) Z W) 

□ 
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We can notice that for the infinite words constructed here, the scales controlled by the 
quasiperiods are very sparse. 



3.2 Topological entropy 

If (X, S) is a minimal subshift, then the limit 

h top (X) = lim -log(> n (X)) 
exists and is named the topological entropy of X. 

Corollary 1. Any multi-scale quasiperiodic subshift has zero topological entropy. 

3.3 Kolmogorov complexity 

Let U be a fixed universal Turing machine and for each finite word u 6 A*, let K v (u) 
denotes the Kolmogorov complexity associated to u, i.e. the length of the shortest binary 
word p such that U(p) = u (see |Bru] ). For a minimal subshift (X, S), we can define 

K{X) = sup hm sup 



(this number is independant of the choice of U since if U' is another universal Turing 
machine, there is a constant C such that for any finite word u, Ku(u) — C < Kw(u) < 
Ku(u) + C). 

Corollary 2. For any multi-scale quasiperiodic subshift (X,S), K(X) = 0. 
Indeed, jBruj Theorem 3.1 asserts that K(X) < h top (X). 



But we can also give a more direct argument : if q is a fixed quasiperiod of a multi-scale 
quasiperiodic subshift (X, S) and if x is in X, then limsup^oo Ku ^~* n ^i < li2l2M)_ To 
prove this, it suffices to remark that the integration algorithm can be coded in 0(1). 
Then, if n is bigger than 4Z(g), there exists three finite words u, w and v such that 
X\^ n = u.w.v, the lengths of u and v are uniformly bounded by /(g), and w begins and 
ends with q (and can therefore be derivated). 

The remaining problem is the control of the length of dw/dq: if the occurences of q 
overlap each other deeply, dw/dq can be rather long. To solve this, we decide to replace 
recursively an occurence of n\.ni in dw/dq by ni + if n\ and n2 are smaller than n/2. 
This operation consists in omitting some useless occurences of q in x\-+ n . Then at least 
half of the numbers appearing in the new form of dw/dq are bigger than l(q)/2, so the 
length of this new dw/dq will be less than 4n/l(q). 

Since the coding of each letter of dw/dq costs log(/(g)), we have a total cost less than 
a constant (to code the integration algorithm, q, u and v) plus log(/(g))4n//(g) (to code 
the new form of dw/dq). 
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4 Unique ergodicity and frequencies 



For a multi-scale quasiperiodic subshift (X, S), we will now study the set A4(X,S) of 
Borel probability measures on X that are invariant under S. This set can be identi- 
fied with a nonempty compact convex subset of C°(X, R)' endowed with the weak-star 
topology. 

A ^-invariant measure /i G M. (X, S) is said to be ergodic if the only Borel sets A C X 
such that S~ 1 (A) = A have measure fi(A) =0 or 1. Such measures are the extremal 
points of A4(X, S) and satisfies Birkhoff's theorem : 



fdfi 



k=0 JX 

A minimal subshift is said to be uniquely ergodic if card(.M(X, S)) = 1. 
One interest of such a situation is that, the unique invariant measure \i is ergodic, more- 
over the convergence in Birkhoff's theorem is uniform for continuous functions. 

Theorem 5. Any multi-scale quasiperiodic subshift (X, S) is uniquely ergodic. 

Proof: 

We will first construct a ^-invariant probability measure on X and then prove that it is 
the only one. 

Step 1: We construct a candidate to be the unique measure. For this, we will approxi- 
mate X by periodic subshifts generated by the q^ = qqq ... for q G Q(X). 
For q in Q(X), let 

def J- \ ^ r. 

(S stands for the one-point Dirac's measure). 

\i q is the only element of Ai(A n , S) that gives mesure 1 to the periodic subshift generated 
by the periodic word q" . By compacity we can find an infinite subset Q' C Q(X) such 
that 

fig > fl 

q&Q' 

for some /i in Ai(A n , S). 

Note that if X is aperiodic the /z g 's give measure to X. However \i will give strictly 
positive measure to X (as we will see in Step 2) : it shouldn't be surprising since the 
characteristic function of X is not continuous. 



Step 2: Let us show that = 1. Since X is closed, we have the following approxi- 

mation by clopen sets: 

x=z =n u M- 

n>l u£L n (X) 

Let n > 1 and let q G Q' such that l(q) > n. For i G {0,...,l(q) — n}, we have 
QtLi+n-i e L n(X) (as a subword of q). Hence /^(LLl„(x)M) > (Ko) ~ n + !)/%)■ 
Letting l{q) tending to infinity, since the caracteristic function of LLeL n (x)M ls contin- 
uous, we have MLLgl (-X")M) = ^ ^ countable intersection (n is arbitrary), we have 
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fJ>(X) = 1. Hence, we can still denote by \i for the restriction of \i to X. 



Step 3: Let v be an ergodic measure on X. We will show that v — /j, 
By Birkhoff's theorem, there is x in X such that for u G L(X), 

1 1 
v([u\) = lim -#(M,x ^„ + i( u )- 2 ) = hm -#(u, xo-»„-i) 

Let g in Q' such that /(g) > /(w). We decompose x into blocks of length 21 (q): x = 
B0.Bx.B2.B3.B4. . . with Bi = x 2 i( q )i^2i(q)(i+i)-i- Since each B { is in L 2 i( q ){X), it contains 
at least one occurence of q, hence #(tt, Bq.Bi.B 2 .- • •■B i ) > (i + g) for each i in N*. 

We have 

KN) = ^^flo-Sx^,.,^) > }^ 2l{q){^lf M = W)*^ q) 

Moreover, we can control the frequency of occurrences of u in g w by estimating the 
occurrences of u in q and bounding the number of occurrences of u that appear between 
two consecutive occurences of q in q w : 

Therefore, 

Letting /(g) tend to infinity, we have /i([w]) < 2i/([u]). So, /x is absolutely continuous 
relatively to v. It is well known that this implies /i — v, but for sake of completeness, 
we include a short proof here. There exists a measurable function / e L X (X, R + ) such 
that for each borel set A C X , fJ>(A) = f A fdv. Since \i is S-invariant, we have f A fdjj, = 
Is- 1 a fdfi for each borel set A C X . 

Let us show that / is constant almost everywhere. Assume by contradiction that the 
measure of set A = {x<E X/f(x) > J x fdv} is in ]0, 1[. Since v is S-ergodic, A is not 
S-invariant, so v(S~ 1 (A) \ A) = v(A \ S~ X {A)) > 0. 

Hence v(A \ S'^A)) J x fdv < J AXS - 1(A) fdv = J A fdv - J AnS - 1{A) fdv = J S _ 1(A) fdv - 

Lns-HA) f dv = Is-HA)\a f dv < »( A \ S ~ X ( A )) fx f dv which is absurd. 

Hence / is constant with value v(X) — 1, so /i — v and \x is the only S'-invariant measure. 

□ 



Corollary 3. Let x be a multi-scale quasiperiodic word. Then each finite wordu occuring 
in x has frequencies i.e. £o->n-i) converges when n — > 00. 

Proof: Let \x be the unique S'-invariant measure for the associated subshift (X, S) . The 
caracteristic function X[u] °f M is continuous, so we have a uniform and therefore a 
pointwise convergence in Birkhoff's theorem : 

1 1 n_1 

-#(u,Xo-m-l) = -y2x[u](S k (x)) >MM) 

n n z — ' n-K» 

fc=0 

□ 
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We can remark that this proof has the same flavour as the the main result of |Bosj that 
asserts that every subshift with subaffine complexity has only a finite number of ergodic 
measures. In fact, TheoremEland Boshernitzan's result can both be deduced from a more 
general statement that involves the geometry of Rauzy graphs associated to a minimal 
subshift: 

To each minimal subshift (X, S) we can associate a sequence (G n )n>i °f oriented graphs 
as follows: the vertices of G n is L n (X) and there is an edge from u to v if and only if 
there exists w in L n+1 (X) such that u is a prefix of w and v is a suffix of w. 
Those graphs are named the Rauzy graphs associated to (X, S) . 

If K > 1, (X, S) is said to be K-deconnectable if there exists an extraction a G T(N*, N*) 
and a constant K' > 1 such that for all n > 1 there exists a subset D a ^ C L a ( n ) (X) of 
at most K vertices such that every path in G a ^{X) \ D a ( n ) is of length at most K'a(n) 
(in particular it do not contains any cycle). This means that, up to extraction, we can 
disconnect (in a specific way) the Rauzy graphs by removing at most K vertices. 

Theorem 6 QMonJ). A K-deconnectable minimal subshift has at most K S -invariant 
ergodic measures. 

This result implies Boshernitzan's one, by taking for D n the set of right special factors 
(i.e. the set of vertices having outgoing degree strictly greater than one). It also implies 
Theorem [HI since every multi-scale quasiperiodic subshift is 1-deconnectable. Indeed, if q 
is in Q(X), Gi(q)(X) \ {q} does not contain any path of length greater than l{q). 
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5 Sturmian subshifts are multi-scale quasiperiodic 



In |LevRicl] . Florence Leve and Gwenael Richomme proved that there exists sturmian 
words that are not quasiperiodic. In terms of complexity, sturmian words are the more 
symmetric words after periodic ones, so this result do not comfirm that quasiperiodicity 
fits well with other notions of symmetry. 

The dynamical point of view will allow us to solve the problem: 
Theorem 7. Sturmian subshifts are multi-scale quasiperiodic. 

Proof: Let (X, S) be a sturmian subshift. Since pi(X) = 2, we can consider that X is 
defined on the alphabet {a, b}. 

A word u G L n X is said to be left special if au and bu are in L n+ x(X). Since p n+ x(X) — 
p n (X) = 1, there exists exactly one left special word l n of length n (n > 0). A prefix of 
a left special word is still a left special word, so l n+ i begins with l n : let us denotes by 
x the infinite word that begins by l n for any integer n. Since L(x) C L(X), x is in X 
(remember that X is closed). We will prove that x is multi-scale quasiperiodic. 

The evolution of the Rauzy graphs of sturmian subshifts is described by Rauzy (see 
|ArnRau| ): for infinitely many n (named "bursts"), the Rauzy graph G n (X) is eight 
shaped i.e. G n (X) is the union of two disjoints loops from l n to l n . By minimality, the 
minimal size of the two loops tends to infinity with n, in particular, it is positive for n 
big enough. Since the sum of the lengths of the two loops is equal to p n (X) = n + 1, then 
each loop has size less or equal than n. 

So, for infinitely many n, any path in G n (X) of length n starting from l n has to meet l n 
again i.e. l n is a quasiperiod of x (x can be viewed as an infinite path in G n (X) starting 
from l n ). Therefore, (X, S) is a multi-scale quasiperiodic subshift. 

□ 

Recently, Florence Leve and Gwenael Richomme gave a precise description of the quasiperi- 
odic sturmian words f |LevR ic2j). 
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6 Conclusion 



The gain of fitting with other symmetry classes obtained by considering multi-scale 
quasiperiodic subshifts instead of quasiperiodic words can be sumarized as follow : 




Figure 1: Quasiperiodic words among other words. 




Figure 2: multi-scale quasiperiodic subshifts among other subshifts. 
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